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Description 



Engineering of Controlled Deamidation 
Rates in Peptides, Proteins, and Similar 

Structures 

Background of Invention 

[0001] The deamidation of peptides and proteins as well as 

molecules related to peptides and proteins is a well known 
phenomenon. In this reaction Asn or Gin residues are 
gradually changed into Asp and Glu residues and their 
isomers respectively. The rate of this reaction is depen- 
dent on the primary sequence, three-dimensional struc- 
ture, pH, temperature, buffer type, ionic strength and 
other solution properties. The half-time varies from less 
than 1 day to more than a century. The reaction intro- 
duces a negative charge into the molecule. In addition, the 
isomerization products R-Asp and R-Glu as well D- 
isomerized forms and chain cleavage also accompany the 
reaction. 

[0002] jhe stability of Asn and Gin in pharmaceutical and other 



types of commercial preparations is a major field of study. 

Efforts have been made to discover formulation conditions 

that will minimize the rate of deamidation of amides in 

these preparations. There is also commercial potential in 

induced or controlled deamidation as an active aspect of 

the product. 
Summary of Invention 

[0003] The inventions described here pertain to the engineering 
of peptides, hormones, and proteins as well as peptide- 
like, hormone-like and protein-like molecules. 

[0004] It has been found in peptides that for sequences of the 

type AsnXxx and GInXxx, where Xxx is any natural or un- 
natural amino acid, the rate of deamidation of either Asn 
or Gin depends very strongly on the identity of Xxx. These 
results are applicable to peptides, proteins and hormones 
as well as any amide-containing molecule with similar 
structure. It is also applicable to isomerization of AspXxx 
and GluXxx sequences. 

[0005] I have done extensive work showing the quantitative se- 
quence dependence of these reactions. I have also in- 
vented a method for applying this sequence dependence 
to proteins, peptides, and other similar molecules, in con- 
junction with their three-dimensional structures. 



[0006] These inventions allow the prediction of deamidation rates 
of amides if the three-dimensional structure in the vicinity 
of the amide is known or predictable. They also provide 
quantitative information about the parameters that make 
up these rates and show which structural elements are 
important for each rate. 

[0007] These inventions can be used to modify structural ele- 
ments to provide stability or controlled instability in 
amides or acids in pharmaceutical and other types of 
commercial preparations. Specifically there are three ma- 
jor types of modifications that can be made that will 
change the rate by amounts that can be quantitatively or 
qualitatively determined from these inventions. Asp and 
Glu residues also undergo reactions controlled in this way. 

[0008] 1, |\/iodification of the residue or residue-like structure to 
the right or left of the amides or acids. This can be done 
by substitution of a different natural or non-natural amino 
acid side chain. 

[0009] 2. Exchange of Asn for Gin or Gin for Asn. Gin deamida- 
tion and probably Glu isomerization is substantially slower 
by a quantitative amount. 

[oo^o] 3, |\/iodification of other surrounding structural elements 
that affect the rate of the reaction as determined by my 



current three-dimensional calculation procedure or a sim- 
ilar procedure resulting from improvements in the current 
method. 

[0011] These inventions allow the engineering of molecules with 
specific amide structures that will deamidate at specified 
rates. These procedures can be used to design stable and 
unstable forms for pharmaceutical, industrial, and other 
products. This can be used to increase the shelf-life of 
such products through minor modifications, prevent or at 
least slow down the gradual formation of impurities in 
preparations with these modifications, and may make 
possible as a result of minor modifications the use of 
products that would otherwise be too unstable for practi- 
cal purposes. The engineering of products with unstable 
amides that are programmed to deamidate at specific 

rates is also a valuable application of this procedure. 
Detailed Description 

[001 2] General Method: 

[0013] It vvas known before the invention of the method de- 
scribed here, that the sequence and structure around an 
amide has a large effect on the deamidation rate. What 
was unknown was how this effect worked or any quantita- 



tive information tliat would allow engineering of stable 
amides or amides with predetermined rates based on 
modification of the sequence. 

[0014] Experiments which I carried out showed quantitatively the 
effects of sequence dependent deamidation. One of the 
discoveries made was that the sequence dependence of 
deamidation is much richer and covers a wider range than 
previously thought. In SZ'C, pH 7.4, 0.15 M Tris buffer, 
the combination of XxxAsnYyy and XxxGlnYyy sequences 
where Xxx and Yyy are any of the naturally occuring 
amino acids covers a range from less than 1 day to over 
15,000 days with the entire range in between available. 

[0015] In addition to the sequence dependent work, I have also 

invented methods that allow application of this sequence 
dependent data to three-dimensional protein structures 
to permit the prediction of protein deamidation rates. This 
method is applicable to any peptide type structures in- 
cluding peptides, hormones, and proteins and peptide- 
like, hormone-like, and protein-like molecules, as well as 
similar structures that deamidate in the same way. 

[0016] This prediction procedure is based on identifying struc- 
tural elements in a protein or similar molecule that con- 
tribute to the rate in known quantitative ways. These in- 



elude, but are not limited to, hydrogen bonds of various 
types, disulfide bonds, alpha-helices, and beta-sheets. 
The effect of each structure depends on a variety of quan- 
titative factors. 

[0017] jhe invention of these prediction techniques had never 

been attempted before. Not only do they allow prediction 
of deamidation rates to very high reliability, but the calcu- 
lation shows what structural features are responsible for 
each particular rate and what changes would be necessary 
to modify the rate in a quantitative manner. 

[0018] Sequence Dependence: 

[0019] Tables 1 and 2 show the sequence dependence of deami- 
dation measured using natural amino acid variations in 
pentapeptides. Non-natural variations provide an even 
greater range of sequences to choose from. 

[0020] Table 1 describes the sequence dependence of Asn se- 
quences. It is based on pentapeptide rates measure in 
37°C, pH 7.4, 0.15 M Tris buffer. The applicability of a 
pentapeptide model to sequence dependence was verified 
in a separate set of experiments. All values listed in this 
table are experimental except for the four values in 
boxes, which were estimated from the rest of the data. 
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[0021] Table 2 describes the sequence dependence of Gin pep- 



tides. It is also based on pentapeptide rates measure in 
37°C, pH 7.4, 0.15 M Tris buffer. In this case, the 52 val- 
ues shown in bold were measured, and the rest of the val- 
ues were derived from surface fitting. 
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[0022] Deamidation rates are affected by a wide variety of pa- 



rameters, including, pH, Temperature, Ionic Strength, and 



Buffer Ions. These rates are measured under pH and Tem- 
perature conditions tliat are applicable to biological sys- 
tems. The buffer type and concentration were chosen to 
minimize ion affects to the extent possible given the ex- 
perimental limitations. Modification of these conditions 
will change the rates in Tables 1 and 2. As long as the 
conditions are not taken to extremes (i.e. high tempera- 
ture, or strongly acidic or basic conditions) the qualitative 
sequence dependence should remain the same and the 
rates reported here can be used with necessary adjust- 
ments. 

[0023] It is also clear that direct hydrolysis of Gin and Asn take 

place in addition to the regular sequence dependent 
mechanism. This hydrolysis is sequence dependent as 
well, but an average value of about an 8010 day half-time 
can be taken as a rough approximation based on this and 
other data measured at the same time. This does not ef- 
fect the Asn rates significantly, but is responsible for the 
leveling off of the Gin rates at around this level. This hy- 
drolysis is also effected by the reaction conditions. 

[0024] The sequence dependence apparent in Tables 1 and 2 is 
of great value in engineering stable amides, unstable 
amides, or amides with particularly desired rates. Isomer- 



ization of acid residues will follow a very similar sequence 
dependence, offset by a determinable amount. 

[0025] qiyi Asn Deamidation: 

[0026] It is apparent from the data shown in Tables 1 and 2 that 
the deamidation rates of Asn and Gin cover markedly dif- 
ferent ranges. One of the discoveries in these experiments 
was that their sequence dependencies are complementary. 
Asn sequences cover the range from about 1 day to 450 
days. Gin picks up at 560 days and carries these rates out 
to tens of thousands of days. 

[0027] jhis opens up a new possibility for engineering of amide 

rates. It is possible to switch half-time ranges simply by 
substituting Asn for Gin or Gin for Asn depending on the 
desired effect. In many cases where it is desirable to in- 
troduce or leave in place an amide, the difference of one 
CH group in chain length may not be critical. 

[0028] Moreover, the fact that this range switching can be done 

raises another possibility. Other modifications of Gin and 
Asn may lie in different ranges. Thus the substitution of 
unnatural amide side-chains is also a valuable procedure. 

[0029] Three-Dimensional Effects of Deamidation: 

[0030] jhe invention of the three-dimensional prediction method 



for deamidation rates has been developed in two phases. 
The first of these was the invention of a technique for de- 
termining deamidation rates in proteins based on manu- 
ally counting the number of each type of structure that 
can affect the rate. Each of these effects is then summed 
with special coefficients to produce the correct rate. The 
procedure was calibrated on known relative deamidation 
rates and then found to be quite accurate in predicting 
absolute rates. 

[0031] Secondly, the procedure was adapted to an automated 

method by means of an extensive C++ program. Some 
modifications were made when this was done, but the ba- 
sic procedure remained the same. 

[0032] I am not attempting to patent this C++ program. There 
are many ways to write such programs and the current 
version is protected by copyright. What is being patented 
is the method used to write it which is based on the man- 
ual procedure and minor modifications and improvements 
that are particularly adapted to computerized calculation 
and include many conceptual innovations. 

[0033] It will be obvious to anyone who studies and understands 
these methods that there are variations in the procedure 
and even some improvements that could be made which 



would yield similar results. Any such modifications are 
understood to be products of this invention and come un- 
der the scope of this patent. 
[0034] The deamidation coefficient, C , for and amide is defined 
as:C =(0.01)(t Xe^^^"^' ^^n. Sn)^ 

D pl/2 

[0035] Here t is the pentapeptide primary structure half life, C 

1/2 m 

is a structure proportionality factor, C is the 3D struc- 

Sn 

ture coefficient for the nth structure observation, S is that 

n 

Observation, and f(C , C . S ) = C [(C )(S ) + (C )(S ) 

mSnn mSll S22 

+ (C )(S ) - (C )(S )/(S ) + (C )(S ) + (C )(S ) + (C 

S 3 3 S 4,5 4 5 S 6 6 S 7 7 S 

)(S ) + (C )(S ) + (C )(1 - S ) + (C )(5 - S ) + (C 

8 8 S 9 9 S 10 10 S 11 11 S 

)(5 - S )]. The structure observations, S , were selected 

12 12 n 

as those most lil<ely to impede deamidations, including 
hydrogen bonds, a-helices, P-sheets, and peptide inflexi- 
bilities. The functional form of C assumes that each of 

D 

these structural factors is added to the reaction activation 
energy. The observed S were: 
[0036] For Asn in an a-helical region: 

[0037] s = distance in residues inside the a-helix from the NH 

1 2 

end, where = 1 designates the end residue in the helix, 
2 is the second residue, and 3 is the third. If the position 
is 4 or greater, S = 0. 
[0038] s = distance in residues inside the a-helix from the 

2 



COOH end, where S = 1 designates the end residue in 
the helix, 2 is the second residue, and 3 is the third. If the 
position is 4 or greater or 0, then S = 0. 
[0039] s = 1 if Asn is designated as completely inside the a- 

helix, because it is 4 or more residues from both ends. If 
the Asn is completely inside, S = 1, S =0, and S = 0. If 
S 0 or S =f-0, then S = 0. 

1 2 3 

[0040] por flexibility of a loop including Asn between two adja- 
cent antiparallel ^sheets: 
[0041] s = number of residues in the loop. 

[0042] s = number of hydrogen bonds in the loop. S > 1 by 

definition. 
[0043] For hydrogen bonds: 

[0044] s = the number of hydrogen bonds to the Asn side chain 
C=0 group. Acceptable values are 0, 1, and 2. 

[0045] s = the number of hydrogen bonds to the Asn side chain 

NH group. Acceptable values are 0, 1, and 2. 

[0046] s = the number of hydrogen bonds to the backbone ni- 
trogen atom in the peptide bond on the COOH side of 
Asn. Hydrogen bonds counted in S or are not included. 
Acceptable values are 0 and 1. This nitrogen atom is used 
in the five-membered succinimide ring. 



[0047] s = additional liydrogen bonds, not included in S , S^, 
and S , that would need to be broken to form the succin- 

8 

imide ring. 

[0048] For Asn situated so that no a-helix, 3-sheet, or disulfide 

bridge structure is between the Asn and the end of the 
peptide chain: 

[0049] s = 1 if the number of residues between the Asn and 

10 

the nearest such structure is 3 or more. If the number of 
intervening residues is 2, 1, or 0, or Asn not between 
structure and chain end, then S =0. 

10 

[0050] If the Asn lies near to any a-helix, P-sheet, or disulfide 

bridge structures: 

[0051] s = the number of residues between the Asn and the 

11 

structure on the NH side, up to a maximum of 5. Values 

2 ^ 

of 0, 1, 2, 3, 4, and 5 are acceptable. 
[0052] s = the number of residues between the Asn and the 

12 

structure on the COOH side, up to a maximum of 5. Val- 
ues of 0, 1, 2, 3, 4, and 5 are acceptable. 
[0053] Hydrogen bonds selected by the Swiss Protein Data Bank 
(PDB) viewer were accepted if the bond length was 3.3 A 
or less and there was room in the structure to accommo- 
date the van der Waals radius of the hydrogen. In the 
computerized procedure this bond length was optimized 



at 4.1 A, and the bond angles and number of bonds per 
atom were adjusted to physically correct and optimized 
values. The Swiss PDB viewer, according to the customary 
criteria, selected a-helices and f-sheets. All primary 

g 

structure t values were those published , except for 

1/2 r I K 

Asn with carboxyl-side Pro, Asn, or Gin and N- 
glycosylated Asn. Estimated values were used for any se- 
quence for which the primary sequence rate was not 
known. 

[0054] Coefficients Used in Equation: 

[0055] Q values ("Coefficient of Deamidation") were optimizedby 

using various values for C and C to maximize the 

m S n 

value of the deamidation resolving power, D , as de- 
scribed in the calibration procedure section. The opti- 
mized values were C = 0.48, C = 1.0, C = 2.5, C 

m SI s 2 S3 

= 10.0, C =0.5,C =1.0, C =1.0, C =3.0, C 

S4,5 S6 S7 S8 S9 

= 2.0, C = 2.0, C = 0.2, and C = 0.7. 

S 10 S 11 S 12 

[0056] As an example, the 3-LysAsn(145)His sequence of 

hemoglobin is not in an a-helix or in a loop between two 
^sheets, so through = 0, S = 1. There is one hydro- 
gen bond to the amide side chain nitrogen and one other 
to be broken to form the imide, but there are none to the 
amide carboxyl or the backbone nitrogen, so S = 0, = 



1, S =0, and S = 1. This Asn is near tlie carboxyl end of 

8 9 

the chain and one residue from an a-helix on the amino 
side, so S = 0, S =1, and S =5. The GlyLysAsnHis- 

10 11 12 ' ' 

Gly half life^ is 10.5 days. Therefore, C = (0.01)(10.5)e^° '^^^^^^^^^^"'^^ 
= (0.105)e^°"^^^^^"^^ = (0.105)(16.184) = 1.70. 

[0057] Q js multiplied by 100 to give the predicted Tris deami- 
dation half-time in days for the amide. 

[0058] Results for Asn are greater than 95% correct in predicting 
the fastest amide in a protein. It is also applicable to Gin. 

[0059] It is also lil<ely that isomerization of Asp and Glu can be 

modeled with the same procedure. Primary rate data on 
Asp and Glu isomerization or a correction factor to be ap- 
plied to the Asn and Gin data is needed in order to do 
this. 

[0060] Conclusions: 

[0061] Three different types of modifications that can be used in 
the engineering of deamidation and/or isomerization 
rates of amides and possibly acids have been invented. 
These are: 

[0062] i_ Modification of the residues or residue-like structures 
on either side of the amide - principally the one on the 
right (carboxyl side). 

[0063] 2. Modification of the amide - specifically Asn to Gin or 



Gin to Asn, but other types of modification can also be 
used, especially in the case of structures that are similar, 
but not a perfect match to those found in peptides, hor- 
mones, and proteins. 

[0064] 3, Modification of the three-dimensional environment 
around the amide. The necessary modifications can be 
determined from the three-dimensional deamidation pre- 
diction method. Each of the S parameters describes a 
quantitative addition to the reaction activation energy. Re- 
moval or addition of one or more of these elements will 
change the rate accordingly. 

[0065] At least two types of deamidation are present. The ones 

on which this method is based, and which are most preva- 
lent for amides with half-times less than a few hundred 
days, depending on conditions and providing especially 
catalytic ions are not present, are most strongly effected 
by the structure to the right of the amide (e.g. in the se- 
quence GlyXxx(Amide/Acid)YyyGly the identity of Yyy is 
the most important factor). Also present is at least one 
more mechanism that is usually slower and has different 
sequence dependence. It is possible that this dependence 
as well as the left hand structure dependence (Xxx in the 
sequence GlyXxx(Amide/Acid)YyyGly) can also be modeled 



with a similar system, but tliis lias not yet been demon- 
strated. 



